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ABSTRACT 



A tool at the service of a distributed application running on 
machines of a distributed data processing system running in 
a local area network, intended for balancing the load on each 
of the machines of the system, includes a master daemon and 
a plurality of agent demons. The master and each of the 
agents calculate the load of the machine on which they are . 
running. The master collects the load data of each of the 
agents at a first sampling interval and sends that collected 
load data to all of the agents. At the request of the distributed 
application, the local agent closest to the application indi- 
cates to the application which machine has the Hghtest load. 
The application then makes the decision to request the 
machine with the lightest load to execute the services the 
application requires. As necessary, the tool selects a master 
fi-om the agents, thereby ensuring the existence and unique- 
ness of a master at all times, regardless of failure affecting 
one 1010 or more machines in the data processing system. 

20 Claims, 1 Drawing Sheet 
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DISTRTOUTED APPLICATION LOAD 
DISTRIBUTION AD) TOOL 

DISTRffiUTED APPLICAnON LOAD, 
DISTOIBUTION AID TOOL 

The present invention relates to a toolkit for balancing the 
load of an application distributed among several machines 
belonging to a distributed data processing system in a local 
area network. 

HELD OF THE INVENTION 

ITie current trend in the development of data processing 
systems is to form a data processing system through the 
association of a plurality of machines connected to one 
another through a network, for example a local area net- 
work. Any user can run applications of widely varied types 
on this set of machines. These applications call services 
which supply the information required to handle the problem 
or problems they are working on, which are offered by all or 
some of these machines. 

When an application in the process of running requires the 
use of a particular service, in current practice, it proceeds in 
the following manner: 
either it chooses, in a purely random manner, the machine 
which will provide it with this service and assigns the 
work to this machine, 
or it makes a circular choice among all the machines, 
which means that it assigns by turns, always in the 
same chronological order, the work of providing the 
services it requires in succession: thus, if the system has 
three machines, it assigns the work of providing the 
services it requires first to machine No. 1, those it 
requires second to machine No. 2, those it requires third 
to machine No. 3, and so on in the following order: No. 
1, No. 2, No. 3, No. 1, etc. 
No matter which of the two cases described above applies, 
none of the work of any of the machines is optimized in 
terms of time, and the capabilities of these machines in terms 
of speed and performance are used only at a level far below 
their maximum. 

There are some known solutions which make it possible 
to eliminate these drawbacks: one of these is described in 
French patent application No. 94 08764, filed Jul. 13, 1994 
by the Assignee of the subject application, under the title 
"Open Data Processing System with Multiple Servers". In a 
system of this type, formed by the association of a central 
system called a client with several servers, each server 
calculates its own load based on the criteria specific to each 
application running on the client, as well as its foreseeable 
development over time, and transmits these two factors to 
the client. The latter, when a particular application requires 
the services of a server, chooses the one with the lightest 
load during the period of time in which the services must be 
rendered and assigns it the work of supplying the services 
requested. 

SUMMARY OF THE INVENTION 

The present invention constitutes an improvement and a 
generalization of the preceding solution. 

According to the invention, the tool at the service of a 
distributed application running on the machines of a distrib- 
uted data processing system in a local area network, which 
is intended for balancing the load on each of these machines, 
is characterized in that it comprises a plurality of data 
processing modules called DAEMONS which run on these 
machines, one of which is called the master, the others being 
agents. 
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the master and the agents each having means for calcu- 
lating the load of the machines on which they are 
running, at first predetermined sampling instants, and 
means for storing the load data of the master and the 
5 agents, 

the master containing: 
means for collecting the load data of each agent, at 

second predetermined sampling instants, 
means for sending the load data of each agent to all of 
10 the other agents, 

each agent containing: 

means for receiving the load data of the other agents, 
the local agent closest to the application indicating to 
the latter, at the request of the latter, the machine 
with the lightest load, the application then making 
the decision to request this machine to execute the 
services it needs. 

BRIEF DESCRIPTION OF THE DRAWING 

20 

Other characteristics and advantages of the present inven- 
tion will become apparent from the following description 
given as a non-limiting example in reference to the 
appended drawings. In these drawings: 
25 FIG. 1 shows a distributed data processing system which 
includes the load balancing toolkit according to the 
invention, 

FIG. 2 shows the controller for distributing the master- 
agent roles among the various elements which constitute the 
3Q load balancing toolkit according to the invention. 

DESCRIPTION OF THE PREFERRED 
EMBODIMENTS 

1) ESSENTIAL CHARACTERISTICS OF THE 
35 TOOL ACCORDING TO THE INVENTION 

A) Structure: 

The various essential characteristic elements of the load 
balancing toolkit ORC for (load balancing toolkit, in 
English) (load balancing toolkit, in English) in a distributed 

40 data processing system according to the invention — for 
simplicity's sake, from this point forward, it will be called 
the "toolkit"— appear in FIG. 1. 

As shown in FIG. 1, the distributed data processing 
system, which can be any type whatsoever, here called SID, 

45 comprises four data processing machines of any size and 
shape whatsoever, namely MCO, MCI, MC2, MC3. Each of 
these machines — small-, medium, or large-scale 
computers — comprise the usual elements, namely one or 
more central processor unit, called CPU , memories asso- 

50 ciated with the latter, input/output units 0/0 units,) and 
means for connecting to the network RE. These means are 
symbolically represented by two-way arrows which repre- 
sent the data links between the four machines MCO through 
MC3 in FIG. L 

55 The toolkit ORC itself comprises the master MSO and the 
three agents Al, A2, A3. Any agent can also be a master, 
depending on conditions which will be explained below. 
Both the master and the agents arc constituted by data 
processing tools known to one skilled in the art as DAE- 

60 MONs. A DAEMON is a data processing tool or entity 
running on a machine, which is capable of responding to a 
question. 

Inside each of the machines MCO through MC3, the 
DAEMONS MSO, Al, A2, A3 are respectively associated 
65 with shared memories MPO, MPl, MP2, MP3. Each of them 
contains the load of the coaesponding machine, but also the 
loads of the other machines in SID. 
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FIG. 1 assumes that the distributed application is running 
on the machine MC2 and that it requires services supplied by 
the other machines MCO, MCI, MC3. This application is 
designated APU. The places where the master and the agents 
are located are independent from the place where APU is 
running. 
B) Operation: 

The following are the main lines of operation of the tool 
ORC, it being understood that during the establishment of 
communication between all the machines in the system SID, 
it is assumed to be established that MSO is the master and 
that Al, A2, A3 are the agents. Refer again to FIG. 1, and 
specifically to the arrows and the circled numbers which 
accompany them, which respectively indicate the direction 
of the information flowing between the master and the 
agents, and the sequence of operations. 

OPERATION 1: Each agent as well as the master collects, 
for the machine on which il is running, at given time 
intervals which constituted the first determined sampling 
instants ti, the load data of this machine, for each of the 
elements which constitute it (the CPU load, the loads of the 
associated menaories, the loadls of the I/Os, the network load, 
etc.). From the load of each element, expressed as a per- 
centage of its maximum allowable load, the total load of the 
machine in question is calculated. This is accomplished by 
load calculation means, respectively MCCO for MSO, MCCl 
for Al, MCC2 for A2, MCC3 for A3. These means are 
simply constituted by calculation programs which imple- 
ment the load calculation method described below, in para- 
graph 2: "Method for load calculation by each of the 
agents."These means are naturally an integral part of the 
master and of each agent, MSO, Al through A3 and for this 
reason they are not represented per se in FIG. 1, for 
simplicity's sake. Once the total load of the machine in 
question is calculated, a set of statistical data on the load of 
this machine, namely DSC, is obtained. In FIG. 1, this 
operation is shown for the agent A2 only, obviously for the 
purpose of preserving the clarity of this figure. 

OPERATION 2: At regular intervals, the agents send MSO 
the statistical load data from the corresponding machine, 
through the network (for A2, through the link L2 between 
MC2 and MCO), 

OPERATION 3: The master MSO centralizes, at practi- 
cally the same regular intervals defined for Operation 2, 
which constitute the second determined sampling instants 
Ti, all the statistical load data of all the agents, as well as its 
own, at the level of its associated shared memory, in this 
case MPO. This centralization is in fact an operation for 
collecting the load data. It is therefore executed by load data 
collection means, respectively MRCO for MSO, MRCl for 
Al, MRC2 for A2, MRC3 for A3, which are in fact 
collection programs integrated into the master and into each 
of the agents Al through A3 and are therefore not repre- 
sented in FIG. I for simplicity's sake. 

OPERATION 4: The master MSO sends, using sending 
means MTCO, all this data to each agent Al, A2, A3 through 
the network RE, namely through the links LI between MCO 
and MCI, L2 between MCO and MC2, L3 between MCO and 
MC3. MTCO is an integral part of MSO and is therefore not 
represented in FIG. 1 for simplicity's sake. 

OPERATION 5: Each agent receives this load data and 
copies it into its associated shared memory, MPl for Al, 
MP2 for A2, MP3 for A3. This is accomplished by the means 
MRCCl through MRCC3 for Al through A3 respectively, 
which means are integral parts of the latter and are not 
represented in FIG. 1 for simplicity's sake. 

OPERATION 6: The application APU scans the shared 
memory of the machine on which it is running to search it 
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35 



for the load estimated for each of the machines, and at the 
moment it needs the determined services to be rendered, it 
deduces the machine with the lightest load at this moment 
and requests the latter to render it these services. 
2) METHOD FOR LOAD CALCULATION BY EACH OF 
THE AGENTS: 

Examples of loads on the CPU, memory, Input/Output, 
and network RE elements are described below. 

The description of the method for load calculation by each 
of the means MCCO through MCC3 is given in reference to 
Tables 1 through 4, which appear below and in which the 
loads are given as percentages. 

TABLE 1 



45 



50 



55 





Sample load data 






stored in any shared memory associated with 


an agent Cexpressed as a %) 




tl 


t2 t3 


t4 


15 t6 t7 


CPU load (Wl) 


35 


12 42 


73 


92 65 33 


Memory load (\V2) 


45 


32 33 


67 


46 32 40 


Network load (W3) 


12 


6 33 


20 


12 38 5 


Input/output load (W4) 


25 


30 56 


46 


78 44 32 


TABLE 2 


synthesis of series of clobal load data for each machine 


tl 


t2 


13 t4 


t5 


t6 t7 


MCO 56 


32 


67 63 


79 


82 54 


MCI 23 


34 


45 56 


67 


62 79 


MC2 32 


38 


34 42 


35 


32 36 


MC3 96 


94 


79 82 


74 


79 68 


TABLE 3 






extrapolation 






of the value of the global load after a time T for each machine 


tl t2 


t3 


t4 tS 


t6 


t7 tS - t7 + T 


MCO 56 32 


67 


63 79 


82 


54 estimated 73 


MCI 23 34 


45 


56 67 


62 


79 estimated 82 


MC2 32 38 


34 


42 35 


32 


36 estimated 36 


MC3 96 94 


79 


82 74 


79 


68 estimated 73 


TABLE 4 


application of the power coefBcicnt, comparison and selection 






MCO 


MCI 


MC2 MC3 


Estimated capacity 




73 


82 


36 76 


Power coeflScient 




2.5 


2 


0.8 1.5 


CoeflEcient of the available 67.5 


36 


51.2 36 



capacity 

(100 - estimated load) ' 
power ooefiicicnt 



60 



65 



The calculation of the load for each agent and master is 
identical to that described in the above-mentioned patent. Il 
is briefly summarized here. 

The total load Wt of an agent (and also of the master) is 
obtained using the following formula: 

Wt-kl*Wl+k2*W2+k3-W3+k4*W4, in which: 

Wl is the percentage of the utilization of the centra! 
processor of the agent in terms of lime. 
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W2 is the percentage of the utilization of the memory of the input/output units, etc. It is recalculated each time the 

the agent, that is, the ratio between the storage capacity hardware configuration of the machine is changed or its 

actually used and its total capacity. . operating system is modified. Likewise, each time the gen- 

W3 is the percentage of the utilization of the network by eral configuration of the distributed data processing system 
the agent, that is, the ratio between the number of 5 siD is changed, all the coefiBcients CP of all the machines in 

pieces of information sent and received by the agent the system are redefined. A Cp equal to 1 corresponds to a 

and the maximum allowable rate in the network. medium-scale machine, which is defined by the user. 

W4 is the percentage of the utilization of the input/output in Table 4, it is possible to read examples of actual 

units by the agent. available capacity rates CI for each machine MCO through 

kl, k2, k3, k4 are specific weighting factors of the MC3. Thus, for MCO, with an estimated capacity rate of 73 

processor, the memory, the network, and the input/ and a power coefiBcient Cp of 2.5, the actual available 

output units. Their sum is equal to 1. Their values capacity rate is 67.5. The same figures are respectively 82, 

depend on the nature of the application in the process 2, 36 for MCI, and so on for MC2 and MC3. 

of running in this ^ APU on the machine MC2 3^ METHOD FOR SELECTING THE MASTER MSO: 

The loads Wl, W2, W3, W4 are measured and Wt is ' 

calculated as shown in each of the tables in y^eadix 1, at ^h^ ^^sic philosophy is that any DAEMON running on 

determined sampling instants tl, t2, t3, t4, t5, t6, t7, etc. of . machine can be a master. It is therefore important to 

the period T (which are, in fact, the instants ti mentioned develop a mechanism which makes it possible to define 

above in the description of Operation 1). ^^^^^ master and the conditions for 

Table 1 gives an example of load data collected by any its selection, as well as the modalities for its replacement 

agent, for example Al, relative to the corresponding should it fail. 

machine MCI, for all the instants tlthrought7. This data is. The selection mechanism must ensure that at least 1 

of course, stored in the shared memory MPl of the machine DAEMON Ls running and that two of them cannot be 
MCI on which Al is running, before it is sent to MSO. ^5 masters simultaneously (especially if they start up at the 

This table shows, for example, that Wl Ls equal to 35 at same time") 

the instant tl , W2 to 67 at the instant t4, W3 to 38 at t6, W4 , . . ^ . . 

to 32 at l7 and so on composed of the following 5 main phases: 

A calculation program API associated with APU, which Phase 1: When a DAEMON starts up, it generates a 

runs 00 MC2, then applies— for the load data of each agent unique identifier ID in conformity with the protocol used in 

and the master which, after the execution of Operation 3, is the network RE, for example in conformity with the TCP-IP 

contained in the shared memory MPO of MCO associated protocol used in the exemplary embodiment of the invention 

with MSO— the weighting factors kl through k4 specific to described herein. This identifier is composed of the Ethernet 

the corresponding machines for the application APU. address (Ethernet is the part of the TCP-IP protocol related 

Thus Table 2 is obtained, which shows, for each of the to local area networks and is used in the exemplary embodi- 

machines MCO through MC4, the global load value Wt at ment described herein. Ethernet being understood to be 

the instants tlthrough t7. Thus it may be seen that, for MCO, standardized and therefore known to one skilled in the art), 

Wt is equal to 56 at the instant tl, 32 at t2, 67 at t3, etc. For the moment at which the identifier is transmitted, and a 

MCI, Wt is equal to 23 at tl, 34 at t2, etc., and so on for the random value. At the same time, it puts itself into an 

other machines. intermediate state and sends these two pieces of information 

The following step for calculating the load for all the (its current state, ID) through the network RE to all the 

machines consists of estimating, by extrapolation, using the machines in the network. 

known mathematical method of least error squares, the value Phase 2: it waits to receive the identical information from 

of the estimated load Wt at the instant t8=(t7+T). the other DEMONS, for determined a lime interval Tr (on 

Thus, Table 3 is obtained. This table makes it possible to the order of 5 to 10 seconds). It is a candidate for the role 

read, for example, that the estimated values of the loads of of master. 

MCO through MC3 at this instant t8 are 73, 82, 36, and 76, phase 3: As soon as it receives this information, it 

respectively. analyzes it. 

Next, a power coefiBcient Cp specific to each machine is ,^ . ^ ^ . ,^vt ^- 

applied to the total load of each machine in order to obtain ^^.^ ^ ^''^ ^ '° ' u'' 

its actual available capacity rate CI, using the formula: '° '^^^ considered to be MSO, it considers itself to be 

an agent. 

ci-(ioo-wtCesiiinated))'Cp If it comes from a DAEMON in an intermediate state, it 

^ , , . , . , , compares its own identifier with the one it receives. 

In effect, it is important to take into account the charac- ,^ . , . . . 

teristics of each machine, given that this is a heterogeneous 55 ^'^^^^^ 

data processing environment wherein the power, the size and ''^^^ "^^^^^^ ^^0. 

the type of the machines which compose it are different. If its own identifier is higher than or equal to the one it 

Thus, if a machine has a light load but at the same time does receives, it cedes the position. It then retransmits the 

not have enough processing power to provide the services two pieces of information (its own ID, its state) and 

requested of it by the application APU at a given moment, 60 ^g^i^ waits for responses during the time interval Tr, 

it is obvious that another machine must provide these ^so called the timer. 

services. Hence the necessity for a correction factor in order Phase 4: Once this time interval has elapsed, the DAE- 

to define the load, and hence the existence of the power MON in question tries again. In order to avoid a loss of 

coefiBcient CP which corresponds to this purpose. messages, which is always possible in the network RE, it 

The coefficient Cp of a given machine is calculated by 65 i^ses the following procedure: 

generating a synthesis of the power of the central processor The transmission and the Listening for responses are 

CPU, the capacity of the memories, the processing power of repeated 5 times. 
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If the DAEMON in question receives a response from 
another DAEMON which reveals itself to be an agent, 
it can be sure that a master MSG exists and it waits for 
the response from the latter to reach it. 
Phase 5: When 5 repetitions have taken place, and the ^ 
DAEMON in question has not received any response from 
the other DAEMONS, it then decides that it is the master 
MSG. 

When one of the three agents Al through A3 realizes that 
the master MSO is no longer communicating with it, it starts 
the procedure below in all its phases, which results in the 
choice of a new master chosen from among the three. 

Moreover, the master periodically notifies all the 
machines in the system SID of its existence. If the master 35 
detects the existence of another master, the procedure is 
restarted by the one whose ID is lower. 

FIG. 2, which shows the controller AUT which distributes 
the master-agent roles among the various DAEMONS run- 
ning on the machines of SID, will make it easier to under- 20 
stand the sequence of the different phases 1 through 5 
described above. 

This controller AUT comprises 5 stales: 

State 10: The DAEMON in question sends the two pieces 
of information (its own ID, its state), which corre- 
sponds to Phase 1. 

State U: This DAEMON listens for responses from the 
other DAEMONS, which corresponds to Phases 2 and 

3. 30 

State 12: This DAEMON waits for the time interval Tr to • 
elapse, and for a possible response from the master 
MSO. 

State A: The DAEMON in question becomes an agent Al, 
A2 or A3, 

State M: The DAEMON in question becomes the ma.ster 
MSO. 

The events which correspond to this controller, which are 
denominated el through e8, are the following: 40 
el: The DAEMON in question has disseminated its ID 

and its state and has set a time interval Tr. 
e2: An ID is received, and the local identifier ID (that of 

the DAEMON in question) is lower than the identifier 

it receives. 

e3: An ID is received, and the local ID is greater than or 

equal to the ID received. 
e4: The time interval Tr has expired. 
e5: The time interval Tr has expired and the number of 
attempts is less than 5, or an agent has just responded. 
e6: The master has just responded. 
e7; The time interval Tr has expired, the number of 

attempts is equal to 5, and no agent has responded. 55 
e8: The connection with the master is lost. 
e9: Detection by a master of the existence of another 

master with a higher ID. 
While this invention has been described in conjunction 
with specific embodiments thereof, it is evident that many 
alternatives, modifications and variations will be apparent to 
those skilled in the art. Accordingly, the preferred embodi- 
ments of the invention as set forth herein, are intended to be 
illustrative, not limiting. Various changes may be made $5 
without departing from the true spirit and full scope of the 
invention as set forth herein and defined in the claims. 
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TABLE 1 



Sample load data 

stored in any shared memory associated with an agent (expressed as a %) 





tl 


12 


t3 


t4 


15 


t6 


t7 


CPU load (Wl) 


35 


12 


42 


73 


92 


65 


33 


Memory load 0V2) 


45 


32 


33 


67 


46 


32 


40 


Network load CW3) 


12 


6 


33 


20 


12 


38 


5 


Input/output load (W4) 


25 


30 


56 


46 


78 


44 


32 



TABLE 2 



synthesis of series of global load data for each machine 





tl 


12 


t3 


t4 


t5 


t6 


t7 


MCO 


56 


32 


67 


63 


79 


82 


54 


MCI 


23 


34 


45 


56 


67 


62 


79 


MC2 


32 


38 


34 


42 


35 


32 


36 


MC3 


96 


94 


79 


82 


74 


79 


68 



TABLE 3 



extrapolation 

of the value of the global load after a time T for each machine 





tl 


t2 


t3 


t4 


t5 


t6 


17 


tS - t7 + T 


MCO 


56 


32 


67 


63 


79 


82 


54 


estimated 73 


MCI 


23 


34 


45 


56 


67 


62 


79 


estimated 82 


MC2 


32 


38 


34 


42 


35 


32 


36 


estimated 36 


MC3 


96 


94 


79 


82 


74 


79 


68 


estimated 73 



TABLE 4 



applicatioQ of the power coefficient, comparison and selection 

MCO MCI MC2 MC3 



Estimated capacity 


73 


82 


36 


76 


Power coefficient 


2.5 


2 


0.8 


1.5 


Coefficient of the available 


67.5 


36 


51.2 


36 


capacity 










(100 - estimated load) * 










power coefficient 











We claim: 

1. A tool (ORC) at the service of a distributed application 
(APU) running on machines (MCO through MC3) of a 
distributed data processing system (SID) in a local area 
network (RE), intended for balancing a load on each of said 
machines, said tool comprising a plurality of data processing 
modules (MSO, Al through A3) caUed DAEMONs which 
run on said machines, one of which is a master (MSO), the 
others being agents (Al through A3), 

the master (MSO) and the agents (Al thorough A3) having 
respective means (MCCO through MCC3) for calculat- 
ing at first determined sampling instants ti the load of 
the machines on which the DAEMONs are running and 
respective storage means (MPO through MP3) for stor- 
ing the load data of the master and the agents, 
the master (MSO) containing: 

means (MRCO through MRC3) for collecting load data 
of the respective agents at second determined sam- 
pling instants Ti, 
means (MTCO) for sending the load data of each agent 
to all of the agents, 
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each agent (Al through A3) containiag: 

respective means (MRCCl through MRCC3) for 
receiving the load data of the other agents, whereby 
at a request of the application the local agent closest 
to the application indicates to the application which 
machine has the lightest load, the application then 
making the decision to request said machine with the 
lightest load to execute services the application 
requires. 

2. The tool according to claim 1, further comprising 
selection means (AUT) for choosing a master from the 
DAEMONS, which ensures the existence and the uniqueness 
of said master at the start-up of said DAEMONS and after 
loss of a master during the execution of the tool (ORG). 

3. The tool according to claim 1, further comprising 
means (AUT, e8) for ensuring continuity of the service 
rendered to the application by the tool in case of a failure 
affecting at least one machine in the data processing system. 

4. The tool according to claim 1, further comprising 
means (AUT, el. State 10) for automatically detecting from 
the network which machines will allow the network to 
retrieve the addresses of all the machines at the start-up of 
the DAEMONS. 

5. A process for implementing a tool at the service of a 
distributed application (APU) running on machines of a data 
processing system including data processing modules (MSO, 
Al through A3) called DAEMONS which run on said 
machines, one of which is a master (MSO), the others being 
agents (Al through A3) the process comprising. 

1) Collecting load data for the machines at first time 
intervals ti on which the applications are running, and 
calculating the total load of each machine from said 
load data and from its power; 

2) At second time intervals Ti, sending the load data from 
agent data processing modules (Al thorough A3) of the 
corresponding machines to a master data processing 
module; 

3) Al said second intervals Ti, centralizing the load data 
of the agents and the master using collection means 
(MRCO through MRC3); 

4) Sending all of said data to each agent (Al through A3); 

5) Copying all said load data from each agent into an 
associated shared memory (MPl through MP3), and 

6) At the moment at which the application (APU) esti- 
mates it will need a particular service to be rendered to 
it, searching in shared memory (MPO through MP3) of 
the machine on which the application (APU) is running 
for the estimated load of each of the machines (MCO 
through MC3), deducing the machine with the lightest 
load at this moment, and then requesting said machine 
to render the service the application requires. 

6. A process for implementing a tool at the service of a 
distributed application APU) runninig on machines of a data 
processing system including data processing modules (MSO, 
Al through A3) called DAEMONS which run on said 
machines, one of which is a master (MSO), the others being 
agents (Al through A3), the process comprising: 

1) collecting load data for machines on which the agents 
and the master (MSO, Al through A3) are running, at 
first time intervals ti; 

2) calculating a total load of a respective machine from 
said data and from the respective machine *s power; 

3) sending the master (MSO) via the respective agents (Al 
through A3), at second time intervals Ti, the load data 
of a respective machine; 
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4) centralizing load data of all the agents as well as load 
data of the master (MSO) via the master (MSO), at said 
second time intervals li; 

5) sending all of said centrahzed load data to each agent 
^ (Al through A3) via the master (MSO); 

6) copying all said centralized load data into respective 
shared memories (MPl through MP3) by each agent 
(Al through A3); and 

10 7) performing the following steps by the application 
(APU) at a moment at which the application (APU) 
estimates the application will need a particular service 
to be rendered: 

a) searching a shared memory (MPO through MP3) of 
^ 5 a machine on which the application (APU) is running 

for an estimated load of each of the machines (MCO 
through MC3), 

b) deducing a machine with the lightest load at said 
moment, and 

c) requesting said machine with the Ughtest load to 
render the application (APU) said service. 

7. The invention as set forth in claim 6 wherein the step 
of calculating a total load is performed by a calculation 

25 program. 

8. The invention as set forth in claim 7 wherein the step 
of centralizing load data is performed by a collection pro- 
gram. 

9. The invention as set forth in claim 7 wherein the step 
30 of sending all of said centralized load data is effected by an 

integral part (MTCO) of said master (MSO). 

10. The invention as set forth in claim 8 wherein the step 
of sending all of said centralized load data is effected by an 
integral part (MTCO) of said master (MSO). 

35 11, The invention as set forth in claim 9 wherein the step 
of sending all of said centralized load data is effected by an 
integral part (MTCO) of said master (MSO). 

12. The invention as set forth in claim 9 wherein the step 
of sending all of said centralized load data is effected by an 
integral part (MTCO) of said master (MSO). 

13, The invention as set forth in claim 6 wherein the step 
of centralizing load data is performed by a collection pro- 
gram. 

^5 14. The invention as set forth in claim 13 wherein the step 
of centralizing load data is performed by a collection pro- 
gram. 

15. The invention as set forth in claim 13 wherein the step 
of sending all of said centralized load data is effected by an 

50 integral part (MTCO) of said master (MSO). 

16. The invention as set forth in claim 14 wherein the step 
of sending all of said centralized load data is effected by an 
integral part (MTCO) of said master (MSO). 

17. TTie invention as set forth in claim 6 wherein the step 
55 of sending all of said centralized load data is effected by an 

integral part (MTCO) of said master (MSO). 

18. The invention as set forth in claim 17 wherein the step 
of centralizing load data is performed by a collection pro- 
gram. 

60 19. The invention as set forth in claim 17 wherein the step 
of sending all of said centralized load data is effected by an 
integral part (MTCO) of said master (MSO). 

20. The invention as set forth in claim 18 wherein the step 
of sending all of said centralized load data is effected by an 

65 integral part (MTCO) of said master (MSO). 
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